Optical telecom networks as weak quantum 
measurements with post-selection 

Nicolas Brunner 

Group of Applied Physics, University of Geneva 

Diploma thesis 

Under the direction of N. Gisin and V. Scarani 
February 1, 2008 



1 



1 Introduction 



In this work we establish a hnk between two apparently unrelated subjects: polarization effects 
in optical fibers and devices, and the quantum theory of weak measurements [J. We show that 
the abstract concept of weak measurements followed by post-selection, introduced a decade ago by 
quantum theorists, naturally appears in the everyday physics of telecom networks. 

Our analogy works as follows. First, polarization mode dispersion (PMD) [2j performs polar- 
ization measurements by spatially separating the fiber's eigenmodes. It turns out that the usual 
telecom limit for PMD, where dispersion has to be minimized, corresponds to the quantum regime 
of weak measurements. Then polarization dependent losses (PDL) |2] perform post-selection in 
a very natural way: one post-selects those photons that have not been lost. This is non-trivial 
physics since the losses depend precisely on the measured degree of freedom: the polarization of 
light. In case of an infinite PDL (i.e. a polarizer) the post-selection is done on a pure state. For 
a finite PDL, the post-selection is done on a mixed state. Thus the amount of PDL characterizes 
the kind of post-selection involved. 

We show also that the quantum formalism of weak measurements can simplify some "telecom" 
calculations and gives a better understanding of the physics of networks. A telecom network can 
be described as a concatenation of elements with PMD (fibers) and some with PDL (couplers, iso- 
lators, etc). A simple formula for the mean time-of- arrival for an arbitrary concatenation of PMD 
and PDL elements is derived. 



2 Polarization mode dispersion 

Polarization mode dispersion (PMD) is the most important polarization effect in optical fibers. It 
is due to the birefringence of the fiber. In standard optics, a fiber is represented by a channel 
supporting two polarization modes. The main consequence of PMD, is that the time of fiight along 
the fiber will be different for each mode. 

Before going into calculations we would like to clarify the notations. We use the formalism of 
two dimensional Jones vectors to describe polarization. In this representation a classical state of 
polarization is equivalent to a quantum spin ^. The three typical pairs of polarizations - horizontal- 
vertical linear, diagonal linear, left-right circular - are described respectively by the eigenvectors of 
the Pauli matrices 



= U i = U ] = U -1 • 



We mostly use the eigenstates of Uz 



az\H) = \H) and az\V) = -\V) (2) 

A general pure state of polarization is a complex superposition of these two states. The state 
\ + H) = cos{9/2)\H) + sin(^/2)e*'^|y) corresponds to the point h = {9, ip) on the Poincare sphere. 

Let's consider a PMD fiber of length L with birefringence vector ^. We define the fiber's axis 
to be the z direction. So \H) and \V), the eigenstates of az, are the eigenmodes of the fiber. We 
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consider a polarized gaussian pulse with coherence time tc- This pulse can be thought of as a 
classical light pulse or as a quantum single photon non-monochromatic state. Taking into account 
both energy and polarization degrees of freedom, the complete initial state reads 



where a, (3 arc complex numbers and satisfy |ap + = 1. In this equation, as usual, the gaussian 
function g{t) is normalized so that 

G(i)-5(t)^ = ^e-*V2*? (4) 

is a probability distribution. Our coordinate system t corresponds to the time-of-arrival of the pulse 
and travels at the speed of light in the fiber (without PMD) Vf = c/n, where n is the refractive 
index in the fiber. A pulse centered at i = propagates at speed Vf. 

PMD is a unitary operation represented by the operator 

i7(e,i) =e^5--^ =cos(|^)a + zsin(|^K • (5) 

where ^ = || ^ ||. To compute the evolution for each eigenmode of the fiber, we have to Fourier- 
transform the input state g{t) \H, V) into the frequency domain, apply the PMD operator to any 
monochromatic component, and integrate back to the time domain. Note that by \H, V), we mean 
\H) or \V). Thus we have 

U{^, z) git) ® \H, V)= j (kjg{uj)e-^* e'i'^"' \H, V) (6) 

where g{oj) is the Fourier-transform of g{t). Since our gaussian pulse is centered in frequency ujq 
we apply the following change of coordinate u ^ u — ujq. We have 

C/(^, z) g{t) ®\H,V) = j dugiu - coo)e-'^'^-'^°^^ e'i'"'^ \H, V) 



I' 



= g{t^^)®e^''^-°\H,V). (7) 

The above equation shows that the mean time of arrival for each eigenmode is shifted by the same 
quantity (^/2) in either direction. Usually the time separating the mean time-of-arrival of the two 
modes is called the differential group delay (DGD) and noted 5t . Thus the output state reads 

|*pmd) = ag^{t)\H) + Pg^{t)\V) , (8) 
where we have omitted the tensor products and defined 

a^e*^-°a , /3 ^ e"*^-"/? , g^{t) ^ g{t ± ^-^) . (9) 
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So the effect of PMD is a spatial separation of the fiber's eigenmodes \H) and \ V) combined with a 
rotation of the polarization around the z axis. This global phase is irrelevant for the link we want 
to establish and will therefore always be absorbed in the polarization state 



-''^m = a\H) + P\V) . (10) 
Note that \H) and \V) are respectively the fastest and slowest polarization modes in the fiber. 



The measurement analogy now goes like this. Different polarization modes will arrive at differ- 
ent times. Thus by measuring the mean time-of-arrival of a pulse one can obtain some information 
about its state of polarization. Depending on whether the delay between the two polarization 
modes is large (small) compared to the coherence time of the pulse, the measurement of polar- 
ization achieved by PMD is strong (weak) (see Fig. In telecom optics the interesting limit is 
always the second one since dispersion has to be minimized. 



a) 





Figure 1: PMD as a polarization measuring device, a) the measurement is strong: the time-of- 
arrival gives full information about the polarization of the pulse, b) the measurement is weak: no 
discrimination possible. 

In standard quantum mechanics, a measurement is an interaction between two systems: the physical 
system to be measured and the measuring device, usually called a pointer [3]. The change in the 
position of the pointer, due to the measurement, provides some information about the state of 
the measured system. In our case the energy degree of freedom of the pulse plays the role of the 
pointer, while the polarization is the measured system. Thus pointer and measured system are here 
different degrees of freedom of the same physical system. PMD provides the necessary interaction 
between the pointer and the measured system since energy and polarization degrees of freedom 
are entangled in the output state (jHJ. However, PMD alone, without the final measurement of the 
time-of-arrival, is not a measurement since it is a reversible operation. 
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3 Polarizer: post-selection on a pure state 



3.1 Strong measurement 

Our aim is to describe networks. A network is an arbitrary concatenation of PMD and PDL 
elements. In this section we study the simplest one: a PMD fiber followed by an element with 
infinite PDL, i.e. a polarizer. Note that this setup corresponds to the usual scenario in which weak 
measurements are studied, since the system is pre- and post-selected. Here the post-selection is 
done on a pure state, since the polarizer projects onto a linear polarization 

\<P)=lJi\H) + u\V). 
At the output, we measure the time-of-arrival. So, the final state is 



1^ 



out 



\4>)m 



PMD 



afig^{t) + f3iyg+{t) 



Fit) 



(11) 



Note that |-F(t)p must be normalized in order to be a probability distribution. As we have seen 
above, because of PMD some information about the preparation is contained in the shape, F{t), 
of the output pulse. 



When 5t >> tc the two gaussians do not overlap, g-{t)g+{t) 0, and it is possible to discriminate 
between the two polarization eigenmodes. The detected intensity corresponds to two well-separated 
gaussians. From we have 



I{t) = = \afl\'^G^{t) + \Pu\'^G+{t) = 

= Prob((/.|F)Prob(F|V')G-(0 + Prob(0|y)Prob(y|V)G_(t) (12) 

where we have introduced obvious notations like Prob(-ff|V') = jap etc. The probability that the 
polarization was \H), given the preparation and the post-selection, is, in the limit 5t >> tc 



VrnUm = /o°°-^(^)^^ = Prob(<A|g)Prob(ff|V;) 

^ > JZomdz Ex=HyProb(<A|K)Prob(K|V^)- ^ ^ 

This is exactly the Aharonov-Bergmann-Lebowitz (ABL) rule, which is in fact the classical rule for 
the probability of sequential events. Of course, Prob(y) = 1 — Prob(-ff). One can also compute 
the mean time of arrival 

(t)=Prob(i/)y+Prob(y)(-^). (14) 
Since the mean value of az is simply (cr^) = Prob(ff) — Prob(y), we have 

it) = Y (CT.) . (15) 

The interpretation of equation ()15|1 is a key point of our work and needs to be carefully explained. 
In quantum-mechanical terms, cj^ is the observable that is measured by the time delay 6t, and 
(|15jl shows that its mean value is associated to the mean time-of-arrival {t) in the case of a strong 
measurement. What happens now when the measurement is weakened? The point is that, in 
contrast to Prob(-ff) and Prob(y), the mean time-of-arrival is a physical quantity that can be 
defined and measured in any situation (for a strong or weak measurement, with or without post- 
selection) . We admit that is the definition of the mean value of az when measured by introducing 
a time delay 6t between \H) and \ V). 
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3.2 General measurement 



We can now remove all assumptions on the strength of the measurement and derive an analytical 
formula for the mean time-of-arrival (t). Lets consider again the output state Without any 

assumption on the gaussians, the intensity is now 

lit) EE \F{t)\^ = \A\^G^it) + \B\''G+it) + 2Re{AB)g^{t)g+{t), (16) 
where A = ap, and B = (3v. The mean time-of-arrival is given by 

... roptim m^-wn 

S^^im \A\^ + \B\^ + 2Re{AB)j^^g.{t)g+{t)df ^ ^ 

We evaluate separately the remaining integral 

g.{t) g+{t) dt = e-i(^-/2*^)^ ^= / e~'''^'' dt = g-^^^^/^*^)' . (18) 
D tf^ylix J —oo 

^ V ' 

=1 

So finally we find 

m = ^ i\A\'-m (19) 

An important feature of equation (|19)) is that the dependance in the strength of the measurement 
(i.e. in dr/tc) is very explicit. Of course in the limit Sr/tc oo, corresponding to a strong mea- 
surement our previous result is recovered. 

Since equation (|19)) is completely general we can compute the mean time-of-arrival in the case 
of a weak measurement, corresponding to the telecom limit of PMD. When 6t « tc equation (|19|) 
becomes 

" T \A\^ + \B\-^ + 2Re(AB) ~ T ^""^aTb^ ^^"^ 

Note that 

A±B = aM±^={<*j*^,j . (21) 

Using (fT3|) . we find 

which is exactly the weak value of cj^ when the post-selection is done on a pure state {(j)), according 
to Aharonov and Vaidman pQ. Note that {(Tz)w can reach arbitrarily large values, leading to an 
apparently paradoxical situation since the eigenvalues of cj^ are ±1. But there is no paradox at 
all since {(7z)w > 1 simply means {t) > This situation is reached by post-selecting on a state 
1^) nearly orthogonal to These are very rare events; the shape F{t) of the pulse is strongly 
distorted, and it is not astonishing that its center of mass could be found far away from its expected 
position in the absence of post-selection. 
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4 PDL: post-selection on a mixed state 



4.1 General measurement 

We now go one step further into the description of a general network and replace the polarizer of 
the previous section by an element with finite PDL, for example a coupler, an isolator, an amplifier 
or a circulator. Neglecting a global attenuation, PDL is represented by a non-unitary operator 



F{^, n) = e'^""/^ ^ cosh(|)l + sinh(|)a„ (23) 

where an = n - a. The most and least attenuated states, respectively | — n) and \ + n), are orthog- 
onal. The attenuation between them, expressed in dB, is 10 log^Q(e^^). Mathematically speaking 
the PDL operator is not a projective measurement, but a more general operation called a POVM 
[3]. In quantum theory, it is usually called a filter, for example in the unambiguous discrimination 
of non-orthogonal quantum states • 

As in the previous section, we derive now a formula for the mean time-of-arrival. To simplify 
the notation we define F = F{ii,ri). The output state is now 



l^^out) = F\^pmd) = A{t)\H) + B{t)\V) , (24) 

where 

A{t) = {H\F\H)ag^{t) + {H\F\V)pg+{t) = {C + n,S)ag.{t) + SnJg+{t) (25) 

B{t) = {V\F\V)h+{t) + {V\F\H)ag^{t) = {C-n,S)Pg+{t) + Sn+ag^{t) (26) 

with C = cosh(^), S = sinh(^) and n± = n^; ± iuy. We compute the output intensity 

lit) = \A{t)\'^ + |S(i)|2 = |a|2(cosh/x + n^sinh(/^))G_(t) + |/3|2(cosh/i - sinh^)G+(t) 

+ 2sinh/xi?e(d/3n+e*^™°)5+(t)5_(t) . (27) 

Using again (|18|) the mean time-of-arrival is 

JZo Ht)dt 2 1 + ^ K(|a|2 - |/3|2) + 2i?e(a^n+e^'5™o)e-^(5-/2*c)2] ' 

where we have defined 7 = tanh /j,. In the limit of a weak measurement and using equation (|15() we 
find 



1 + jh-a V (V^lFtFlV) ^ V {ij\F^\ij) J' ^ ' 

where we have used the fact that F is self-adjoint, i.e. -F^ = F. This is exactly the expression 
given by quantum theorists for the mean value of cr^ when post-selection is done on the mixed state 
p = ^J-^.^^ F'^. Note however that when a photon comes out, it is left in the pure state F\'iIj). So 
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the meaning of post-selection on a mixed state has to be explained. In the theory of weak measure- 
ments, the state of the system at the time of the intermediate weak measurement is determined by 
two different pieces of information: one coming from the past, i.e. the state in which the system 
was pre-selected, and one coming from the future, the post-selected state. In our case this second 
piece of information is a mixed state since it corresponds to having the identity evolve back through 
the system, i.e. the mixed state p. 

It is interesting to work out the limiting cases of equation (|29j) . When there is no PDL at all, 
7 = and we recover {az)w = Wz)^ since there is no post-selection. For 7 = 1 which corresponds 
to an infinite PDL (/i — > 00) we recover our previous result for the post-selection on a pure state 
(El, with |(/)) = \+ n). 

4.2 Anomalous dispersion and principal states of polarization 

As (cj^) u, can be larger than one we recover anomalous dispersion, which was one of the major 
results of combining the effects of PMD and PDL [21. From (|29j) the maximum (and minimum) 
mean time-of-arrival can be computed. The largest value is obtained when PDL is orthogonal to 
PMD, say h = x. Varying over the input polarization |^) = a\H) + /^IV), we find 

6t 1 

max/mm{„,/3} {t) = ± — -j=== , (30) 

which corresponds to the results of Note however that the factor 7 appearing in this last 
equation has a completely different meaning in |2j. It is defined as the overlap of the principal 
states of polarization (PSP), which are the polarization states such that the output polarization 
is independent of the optical frequency in first order. The concept of PSP plays a key role in the 
usual PMD-PDL theory. It is quite interesting to see that we recover these states in the quantum 
approach. In fact they are simply the PMD fiber's eigenmodes after their evolution through the 
setup, i.e. F\li) and F\y). One can easily check that their overlap is equal to 7 and 

that their mean time-of-arrival is ^5t /2. 

5 Quantum formalism for optical networks 
5.1 General network 

PMD PDL PMD PMD PDL 

^. 4 Fu 

Figure 2: General optical network. A concatenation of PMD and PDL elements. 

In this section we show that our work is more than a beautiful analogy and that the quantum 
formalism can greatly simplify some telecom calculations. We compute the mean time-of-arrival 
for an arbitrary concatenation of PMD and PDL elements (see FigH). Each PMD section is 
characterized by a birefringence vector its norm being equal to the DGD, and its direction 
specifying the measured observable ^ ^„ • = (T„ . Each PDL element is represented by an operator 
Fn- Our input state is again a polarized gaussian pulse \^in) = \g{t)) ® of central frequency 
with coherence time tc- Considering energy and polarization degrees of freedom the output state is 
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j=N 



-"^^ \g{t)) ® \^) . 



(31) 



The product operation has to be carefuhy defined since operators Fj do not commute. We use the 
notation 



N 



Y[Fj = FiF2---FN and Y[ F^ = F^F^-i ■ ■ ■ Fi 



(32) 



j=N 



As it was shown in section 2, the effect of each PMD section is a spatial separation of the fiber's 
eigenmodes and a global rotation around the fiber's axis. Since both operations commute we can 
rewrite 



(33) 



j=N 



j=N 



where we have defined Fj = Fj e^'^"~"^ . Note that F is not self-adjoint. Since we work in the 
telecom limit of PMD, which we have shown to be equivalent to the regime of weak measurements, 
all DGD's (5r„ are assumed to be small compared to the coherence time tc of the optical pulse. 
Therefore the exponential in the above equation can be expanded since 5uj5Tj = 5Tj/tc << 1. We 
have 



1^- 



5t^ 



out! 



n (1 + i5u:^aj) \g{t)) ® 



j=N 



N 



5t. 



( n ^i) + ^'^^ E ^^ivi^7V-i • • • FjajFj^i • • • Fi 



j=N 



\g{t)) 



(34) 



We now write the state in a well-chosen polarization basis, namely { nj=Ar Fjlip) , |orthog)}, where 
|orthog) is defined by (orthog| Ylj=N Fj\tp) = . Note that since Fj are non-unitary operations, the 
state Y\j=j\i Fj\ip) is not normalized. We write its norm B = (^/;| YljLi Pj Y\j=]\[ Fj\ip) . In equation 
(|34)1 we insert the completeness relation 



-j^ 1 N 

1 = "52 n n + I orthog) (orthogi 

^ j=N j=l 



(35) 



so that 



1^' 



out/ 



h n 

j=N 



+ I orthog) 



N / I 

i 

j=l ^ j=N 

1 



N 



5Ti 



I[^H^Ii^j) + i^^T.^^N---F,a,F,_,...F, 



N 



5Tn 



(orthogi ( n Fj) + E -^^^ • • • ^i^i^J-i • • • ^1 



j=N 



\9{t)) 



\g{t)) IV-) 
E> IV-) . (36) 
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The probability of finding the photon in the state |orthog) is of order 0{duj5Tj)'^ , and therefore 
negligible. So whenever the photon manages to come out it is left in the pure state 



1^- 



out/ 



1 

1 + i6ujW ' 



git)) m F,\i;)) 
j=N 

e'''^'^ (m) ill F,m 

j=N 



attenuation T"""! TZ 

normahzed state 



where 



j=l " KV\ i\.j=\ j i\.3=N^3 

So the mean time-of-arrival is 



(37) 



_ f (^1 nf=i Fj ng;v h 03 nL, m) 
h 2 (V'inf=ii^/nU^.i^) 



(39) 



Note that the state ()37() is not normalized because of the losses in the system, and that the 
attenuation depends on the central frequency of the pulse wq- Expression may seem quite 
cumbersome but can be rewritten in a far more intuitive way. We define the following notation 



•"(S.I«-)-^-(fRf)- (-) 

So w(^, |V'))P) is the mean time shift of a PMD section with birefringence vector ^ when the input 
state is \'^) and the post-selection is done on the mixed state p. With this, equation (|39|) reads 

(t) = w{i^ , IV) ,fI--- F^Fm • • • Fi) + w{i2 , IV') ,fI-- fIFn ■■■F2) 

+ ■■■+ wiiN , Fn-1 • • • AIV') , fJ^Fn) (41) 

So the mean time-of-arrival is simply the sum of the contributions of each PMD section computed 
when forgetting about all others PMD's. This is quite natural since all PMD's are assumed to be 
weak measurements, which means they modify only slightly the shape of the pulse {6Tj « tc Vj). 

So we obtain an analytical formula for the mean time-of-arrival for an arbitrary concatenation 
of PMD and PDL elements. It should be stressed that the structure of this formula is very simple. 
In this sense we feel our analogy simplifies telecom calculation since the equivalent computation in 
the usual PMD-PDL language is less straightforward. 

Another important result of was that any concatenation of PMD and PDL elements is equivalent 
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to a simple setup where an effective PMD is followed by an effective frequency-dependent PDL. 
This is a consequence of the polar decomposition theorem for complex matrices, which states that 
any complex matrix T can be decomposed into a unitary matrix U and a positive Hermitian one A, 
so that T = AU. Unfortunately we were unable to recover this result in the quantum formalism. 



5.2 Optimal concatenation problem 

To illustrate the transparency of the structure of equation ()41|) . we discuss the following problem. 
How should one assemble a given number of PMD and PDL sections in order to maximize (or 
minimize) the mean time-of-arrival (t)? In other words which setup optimizes the interaction 
between PMD and PDL. For definiteness, we consider a five elements PMD-PDL-PMD-PDL-PMD 
network. We omit all rotations due to PMD sections, because they clearly play no role in our 
problem. Note that anyway, one could change the lengths of each PMD element so that the 
rotation due to PMD is a multiple of 2tt. Using equation (|41|) we have 



it) 



5ti {i;\FiFiF,ai\ 
2 \i 



+ 



5t2 {^P\FiFia2Fi 



WiFiFi 



5t3 

2 



(42) 



Numerical simulations show that in order to maximize (t), one has to align respectively all PMD's 
and all PDL's and choose the PDL axis orthogonal to the PMD axis. Now one question remains: 
should the PDL be distributed all along the network or simply grouped at the end of the setup. 
It turns out that the second choice is better (see Fig|31) and our formula clearly shows why. From 
the three terms of equation (|42|) the first one is the largest, since all filters contribute to the post- 
selection. But this first term is precisely the contribution of the setup where all the PDL is at the 
end. Thus {t) is maximized whenever the weight of the first term (say St/2) is the largest. This is 
of course the case when all PMD's sections are put together. Since they are all parallel, we have 
<5T/2 = Ef=i'^r,/2. 





2.1 2.2 2.3 2.4 2.5 2.6 



Figure 3: Mean time-of-arrival {t) versus polarization (we consider linear polarizations \tp) = 
cos(^/2)|/7) + sm{(p / 2)\V)) . In this example we used: Stj = 1, for j = 1,2,3 and fij = 1 for 
j = 1,2. (a) Five elements PMD-PDL-PMD-PDL-PMD network. The PDL is distributed along 
the line, (b) PMD(3)-PDL(2) setup. Ah PMD (PDL) sections are grouped at the beginning (at 
the end) of the setup. The time shift can be made nearly three times larger with this second setup. 
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6 Conclusion 



In conclusion, we have demonstrated that the formahsm of weak measurements with post-selection 
describes important polarization effects in the physics of telecom optics. It is quite nice and surpris- 
ing to see that telecom engineers and quantum theorists, two apparently completely unconnected 
categories of physicist, speak of the same things, each one in his own language. We also showed 
that the quantum formalism simplifies telecom calculations and gives a better understanding of the 
physics of networks. It must also be mentioned that with this work we close a loop of analogies. 
On the one hand, Gisin and Go showed in the strong analogy between PMD-PDL effects in 
networks, and the mixing and decay that are intrinsic to kaons. Remember that the kaon system is 
one of the most celebrated examples of a system evolving according to an effective non hermitian 
Hamiltonian. On the other hand, it was shown in that a system coupled to another suitably pre- 
and post-selected system can also evolve according to an effective non hermitian Hamiltonian. So 
our work closes the loop by showing the link between PMD-PDL effects and weak measurements 
with post-selection. 
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